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CO ' Abstract. Given oracle access to a Boolean function / : {0, 1}" i— )■ {0, 1}, we design a 

randomized tester that takes as input a parameter e > 0, and outputs Yes if the function is 
' monotone, and outputs No with probability > 2/3, if the function is £-far from monotone. 

I That is, / needs to be modified at e-fraction of the points to make it monotone. Our 

• non-adaptive, one-sided tester makes 0{n^/^e^^^^) queries to the oracle. 

1. Introduction 

Testing monotonicity of Boolean functions is a classical question in property testing. The 
Boolean hypercube {0, 1}" defines a natural partial with x -< y iS Xi < i/i for all i G [n]. A 
Boolean function / : {0, l}" {0, 1} is monotone if f{x) < f{y) whenever x -< y. 

c/3 ■ 

^ A Boolean function's distance to monotonicity is the minimum fraction of points at which 

it needs to be modified to make it monotone. In the property testing framework, we are 
provided oracle access to the function / and given a parameter e > 0. A monotonicity tester 
\0 • is an algorithm that accepts, if the function is monotone and rejects, if the function is e-far 
, from monotone. The tester is allowed to be randomized, and has to be correct with non- 

Tjj- I trivial probability (say > 2/3). The tester is called one-sided if the tester always accepts a 
^ ' monotone function. The tester is non-adaptive if the queries made by the algorithm do not 
O ■ depend on the answers given by the oracle. 

m 

The quality of a monotonicity tester is governed by the number of oracle queries as well 
as the running time. Goldreich et al. [8] suggested the following simple tester: query the 
^ ■ function value on a pair of points that differ on exactly a single coordinate and reject if 
5^ ■ monotonicity is violated. In other words, the tester samples a random edge of the hypercube 
and checks for monotonicity between the two endpoints. This is called the edge tester for 
monotonicity; it is clear the running time is of the same order as the query time. 

Goldreich et al. [8] show that 0(n/e)-queries by the edge tester suffice to test monotonicity. 
They also show that their analysis is tight, so the edge tester can do no better. They 
explicitly ask whether there exists a tester with an improved query complexity in terms of 
n? Fischer et al. [7] show that any non-adaptive, one-sided tester^ for monotonicity must 
make f2(-(/n)-queries for constant e > 0. While monotonicity has been extensively studied 
in property testing [6, 8, 5, 10, 7, 9, 11, 1, 3, 2, 4], no significant progress had been made on 
this decade old question of testing monotonicity of Boolean functions. 



Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corpo- 
ration, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy's 
National Nuclear Security Administration under contract DE-AC04-94AL85000. 

^[7] also show a fl(logn) lower bound for 2-sided testers. 
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Our main result is an affirmative answer to the above question of [8]. For brevity, we use the 
tilde notation to hide log factors; 0{h) contains functions dominated by h-polj{logh). 

Theorem 1. There exists a one-sided, non-adaptive 0{n^^^e~^^^) -query monotonicity tester 
for Boolean functions f : {0, 1}" t-)- {0, 1}. 

We get an improved bound for functions with low average sensitivity. Given a Boolean 
function /, the influence of dimension i, denoted as Infj, is defined as the fraction of edges on 
the hypercube crossing the ith dimension whose endpoints have different function values. The 
average sensitivity, denoted as is the sum of all the n influences. The functions defined 
in [7] to prove lower bounds of Q{y/n) for non-adaptive, one-sided testers have constant 
average sensitivity, and hence the following is optimal for that setting. 

Theorem 2. There exists a one-sided, non-adaptive 0{y/npo\j(l{f) /e))-query monotonicity 
tester for Boolean functions of average sensitivity I(/). 

1.1. Pair testers. A pair tester [5] describes a fixed distribution (independent of the func- 
tion) on domain pairs {x -< y), makes independent queries on pairs drawn from this distri- 
bution, and rejects iff some drawn pair violates monotonicity. By definition, pair testers are 
non-adaptive and one-sided. (Note that the edge tester is a pair tester.) Brief et al. [3] show 
that any pair tester requires fl{n/{elogn)) samples and the linear dependence on 0{l/e) is 
crucial in their argument. Our tester is also a pair tester. We circumvent the lower bound 
(on n) of [3] because of the worse dependence on e. 

1.2. Main Ideas. Our tester is a combination of the edge tester and what we call the path 
tester. The path tester essentially samples a random point x on the hypercube, performs 
a sufficiently long random length walk 'up' the directed hypercube to reach y, queries f{x) 
and f{y) and tests for monotonicity. 

Our algorithm is inspired by a recent paper by Ron et al. [12], which shows a 0{^/n)-queTJ 
randomized algorithm to estimate the average sensitivity of a monotone function. The 
algorithm essentially performs the operation above and counts the number of mismatches; 
Ron et al. [12] explicitly ask whether an algorithm "in the spirit" above can be used for 
monotonicity. Our answer is yes. 

Consider a function / is e-far from monotone. The aim of any tester is to detect a violation, 
that is, a pair x -< y such that 1 = f{x) > f{y) = 0. The success probability of the 
edge tester is exactly the fraction of violated edges. The intuition is that there are possibly 
many more violations that are "far away" and the directed random walk will help detect 
those. Consider the function / : {0,1}"+^ ^ {0,1}, /(0,a;) = if < n/2 - 1^ 
and 1 otherwise; /(l,x) = if |x| < n/2 -|- 2^fn and 1 otherwise. This function has a 
constant distance to monotonicity, and all the violated edges are of the form ((0,x), (l,a:)) 
for n/2 — 2i/n < |x| < n/2 -|- 2-v/n. The edge tester detects a violation with probability only 
0(l/n). Suppose we pick a uniform random point and perform of length < i < y/n. 

If the starting point has in the ffist coordinate and any of the 0(i/n) steps flip the ffist 
coordinate, the walk detects a violation. This happens with probability 0(1/^^), handily 
beating the edge tester. 
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The argument above required the violated edges to be ahgned along one dimension. We prove 
(in §2.2) that directed random walk detects a violation (with sufficiently high probability) 
when there is a large matching of violated edges. One of the ingredients of this proof is the 
following interesting combinatorial observation. In §2.1, we prove that if an e-fraction of 
the hypercube is marked blue, then the probability that the random walk starts and ends 
at a blue point is f2(£^). It shows that the endpoints of this random walk, which are highly 
correlated, behave like two independent samples as far as being blue is concerned. 

But what if no large matching of violated edges exists? Take the 'anti-majority' function, 
which is far from monotone, yet only has a matching of violated edges of size Q{2"'/^/n). 
This is dealt with by our dichotomy theorem. In §2.3, we prove that for any s > 0, either 
there exists Q{se2") violated edges, or there exists a matching of Q{e2^/s) violated edges. 
With this we are done; in the former case, the edge tester suffices, in the latter the path 
tester suffices. The factor n^^^ is the optimal tradeoff obtained by our approach. 

The proof of our dichotomy theorem combines two ideas discovered earlier in the context of 
monotonicity testing. The ffist is a theorem of Lehman and Ron [10] on multiple source-sink 
routing over the hypercube. The second is an alternating paths machinery developed by the 
authors in a separate work [4] for general range monotonicity. 

2. The Tester and its Analysis 

Fix a parameter e > 0. Our tester accepts if the function is monotone, and rejects with 
probability > 2/3 if / is e-far from being monotone. We assume without loss of generality 
that e < 1/2 since any function can be made monotone by changing at most 1/2 of its values. 
Set parameter i := 2\C^^fn ] , where = A/l01n(l/e). Since e < 1/2, we have > 2 and 
i > Ii denotes the index set [n/2 - i/2, n/2 + £/2]. 

For a binary vector x,?/ G {0, 1}", |x| is the number of I's in x and ||a; — is the £i-distance 
between x and y. The all zeros and all ones vectors are denoted 0" and 1", respectively. 
The directed hypercube is the directed graph with vertex set {0, 1}", and an arc from x 
to y if X -< y and \\y — x\\i = 1. Throughout the paper, u.a.r. stands for 'uniformly at 
random' . 

We describe a random walk based procedure called the path tester. 
Path Tester. 

(1) Let V be the collection of paths in the directed hypercube from 0"^ to 1"^. Pick a path 
p G "P u.a.r. Let Xp := {z E p : \z\ G Ii}. 

(2) Sample x G Xp u.a.r. 

(3) Let Yp(x) := {z G Xp : — a;||i > 1}. Sample y G Yp{x) u.a.r. 

(4) Reject if {x,y) violate monotonicity; i.e. /(x) < f{y),x y y or f{x) > f{y),x -< y. 

This is clearly a pair tester (and is hence non-adaptive and one-sided). Our final tester runs 
either the path tester or the edge tester, each with probability 1/2. 
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The heart of our work hes in lower bounding the probabihty of rejection when the function 
/ is e-far from monotonicity. Henceforth, we assume the function / is £-far, and we call the 
rejection event a success. Recall, since / is £-far from monotonicity, any maximal set M of 
disjoint, violating pairs satisfies \M\ > eT^~^ [7]. We refer to M as a matching of violated 
pairs. 

For 1 < 2 < n, Lj := {a; G {0, 1}" : |a;| = i\ denotes the zth layer of the directed hypercube. 
We refer to IJie/^ middle layers of the hypercube. 

Proposition 2.1. (a) | [J.^j^Li\ < £^2". (b) A u.a.r path p contains a u.a.r vertex from Li. 

Proof. By Chernoff bounds, for a u.a.r x G {0, 1}", Pr[||a;| — n/2\ > 1/2] < 2e"^^/^". Since 
i = 2\^Jl{}n log(l/£:) ] , this probability is at most . For the second part, observe the 
number of paths in V that pass through a given vertex x depends solely on □ 



2.1. Going from blue to blue. Assume e-fraction of the hypercube is colored blue. Let 
(x, y) be a random pair sampled by the path tester, and let £ be the event that both x and 
y are blue. If x and y were chosen independently u.a.r., then the probability of both being 
blue is e^. The following lemma shows that this probability does not degrade much even 
though X and y are correlated (for instance, they form an ancestor-descendant pair). 

/ £^ 

Lemma 2.1. Prf^l = Vt — — - 

Vln(l/e, 

Proof. For notational convenience, set := e/lGCg. This implies^ |Xp| — |yp(x)| < /i£ for 
any x G p. Let 6(p) be the random variable denoting the number of blue points in Xp 
corresponding to a random path p. Let 8^ and £y be the probabilities that the first and 
second points are blue; that \s £ = £x /\ £y. Abusing notation, p will also denote the event 
that p is the sampled path. 

Conditioned on a path p being sampled, the probability of the first point x sampled by the 
path tester being blue, that is Pt[£x \ p] = &(p)/^- 

Conditioned on the path being p and the first point being x (irrespective of it being blue or 
not), the probability that the second point y is blue is the number of blue points in Y-p{x) 
divided by |Fp(a;)|. The number of blue points is at least 6(p) — /i£ since |Xp| — |yp(a;)| < 
Therefore, 

Pr[^, I p,a; first point] = '^^^^ ^^^^^^^j "^^^^^^ > Mpi, 
Since the above inequality holds for all x (in particular all the blue x's). 



^The curious reader may be wonder why we have a "—1" in the distance condition for Yp{x) in the 
description of the path tester. This is a technicahty so that we have the bound |Xp| — |yp(a;)| < jit. Without 
the —1, the bound would be /i£ + 2 and can be made 0{fi£) only for large enough e. So instead of enforcing 
such a condition or carrying around a +2, the "—1" allows for a cleaner presentation. 
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Together, we get 

Pt[S] = J]Pr[f,|p,fJ-Pr[f,.|p]-Pr[p] 

pev 

— i-pi 2-^ 



pev 



V\^\iJ \v\ 



In the following, we use E[...] to denote the expectation over the choice of the path p. 
Claim 2.1. E[b{p)/i] := ^ Epep(Kp)/^) > e/8C,. 



Proof. Note that, for all i, < (^"2) ^ Let be the number of blue vertices in 
layer Lj and Zi be the indicator variable for the zth layer vertex in p being blue. Hence, 
6(p) = Ylieii, ^i- For all i, a p chosen u.a.r from V contains a uniform random vertex in 
layer Lj. (Prop. 2.1). Thus, 

E[Z.] = A>— 

By the tail bound of Prop. 2.1, the number of vertices not in the middle layers is at most 
e^2". Since e < 1/2, JZiei^ ni > {e — 2e^)2"' > e2"~^. Using linearity of expectation and the 
boun i < ^C^^, 

iGli 

□ 



We express the bound of (1) in terms of expectations and apply Jensen's inequality. 

Pr[^] > E[(6(p)/£)V/"E[6(p)/£] 

> {E[b{p)/i]r - f^E[b{p)/i] = E[bip)/imb{p)/e] - f,) 

The function h{x) = x{x — n) is increasing when x > fi/2. Hence, we can apply the lower 
bound of Claim 2.1 to get Pr[£^] > (e/8Ce)(£/8C£ — /x). Plugging back /x = e/lQC^, we 
complete the proof of Lemma 2.1. □ 



2.2. Large violated-edge matchings are good. We bound the success of the path tester 
when a large matching of violated edges exists. 

Lemma 2.2. Suppose there exists a matching E of violated edges in the middle layers of the 
hypercube. Then the path tester succeeds with probability 
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Proof. Wc begin with some notation. Denote Bq = {x & B : f(x) — 0} and Bi = {x ^ 
B : f{x) = 1}. Note that \Bq\ = \Bi\ = \E\. For any two points x,y, let S^^y denote the 
event that the path tester picks {x,y). For convenience, in what follows, the pairs of E will 
be ordered according to the directed hypercube (so if {z,z') e E, then z ~< z'). We abuse 
notation to define £^ as a function. For edge (z, z') e E, we set E{z) — z' and E{z') — z. 

We define the following sets of pairs of vertices. 

ei 

y = {{x,y)\x -< y, \\x-y\\i > - l,xe Bi,y e Bi} 

V'^{{x,E{ymx,y)eV} 

A few observations. V lies in the support of the pair tester, that is, pairs {x, y) sampled with 
non-zero probability. Every pair in V is a violation; for (x, y) G V ^ we have x -< y, y E Bi 
implying E{y) >- y and E{y) G Bq. Finally, the mapping {x,y) G V to {x,E{y)) G V is 
one-to-one. This uses the fact that E is a matching (and is a crucial piece of the proof). 

Since all pairs in V are violations, 

Pr[ success 

(x,y')eV' 

Using the mapping between V and V and that Pr[£a;^j^] > for (x, y) G V, 
{x,y')ev' {x,y')ev' ^ ^,^(y )i 

(x,y)€V L ^'^J 

We break the remaining proof into simpler claims. For a vertex x, define s{x) := \Yp{x)\ = 
\{z G Xp : ll^; — a;||i > e£/32Ce — 1}| where p is some path containing x. This is well-defined 
since |^(x)| is independent of p for any p 3 x. In fact, 

ei 

^e/,:H-|x||>32^-l 



(3) s{x) 
The following claim is just a routine calculation. 

32Ce ' ^'f 



Claim 2.2. Suppose x,y are in the middle layers and \\x — y\\i > — 1. Let Vx,y denote 



the set of paths containing both x and y. Define 

- J \s{x) + s{y) 

Then, 



(4) Pr[^.,.] = e.. 



\V I 



x,y 



Proof. Note that 

Pr[^a;,2/] = Y^ Pr[p sampled ] • Pr[x, y sampled | p sampled 



p:a;,j/ep 
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Since — y\\i > 1, ?/ G Yp{x) (and vice versa). Suppose x is the first point to be 

sampled; this happens with probabihty The probabihty y is the second point sampled 
is jy^;^- Arguing analogously when y is sampled first, when x,y E Xp, 

Pr[x, 2/ sampled | p sampled ] = j (^ \y\x)\ ^ |y " ^^'y 

The proof concludes by noting that ^p.^; j^gp Pr[p sampled] = ■'^rp- □ 

The next claim shows that for any {x, y) G V, dx,E{y) is almost as large as O^^y 
Claim 2.3. For {x,y) G V, e,^E{y) > (l - ^) 0,^y 

Proof. For convenience, let y' denote E{y); note that y' >~ y and |?/'| = |?/| + 1. There exists 
some path containing x,y, and y'. From (3), s{y) < s{y') < s{y) + 1. 



Putting it all together. 



0.,y six)-^ + siyy^ - s{y)-^ " s{y) + 1 

c+g ^ a 
c+b — b 



The first inequality follows from the observation > j- whenever a < b and c > 0. Since 



Claim 2.4. For {x,y) G V, 

I 

Proof. Combining Claim 2.2 and Claim 2.3, 

Pr[^x,i;fa)] ^ Ox,E{y)\Vx,E{y)\ ^ A I \ \'Px,E{y)\ 



P^\£x,y\ (^x,y\T^x,y\ \ / I'Px 



x,y\ 



We know exactly what both the numbers in the RHS are. Say |x| = t and \y\ = t + s. Note 
s > ee/32Ce - 1. Also note \E{y) \ = \y\ + 1. Then, 

\Vx,y\ = t\s\{n - s - t)l and = t!(s + l)!(n - s - t - 1)! 

Plugging in (5), 

The denominator is Q{n) since n/2 — Cg^^Jn < \y\ < n/2 + C^^fn. The numerator is at least 
el/?>2Ce = Q{ey/n), completing the proof. □ 

Going back to (2), 

Pr[ success ]> ^^i^^^y^^^pT = ^{^ ' E 



ix,y)eV ^ ^ {x,y)€V 

Now for the punchline. Color all points in Bi blue. By Lemma 2.1, the probability that the 
random path tester samples a pair {x,y) such that both points are blue is n((|i?i|/2")^). 
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This is the event that x -< y (or y -< x) , \\y — x\\i > — 1, and x,y E Bi. The probabihty 
of this event is exactly twice J2{x y)ev^^[^^,y\- Hence, 

~ I e /|i?l\2" 

Pr[ success ] = 



n V 2" 



□ 

2.3. Wrapping it up with the dichotomy. We state the dichotomy theorem between the 
total number of violated edges and largest matching of violated edges. 

Theorem 3. For any function f that is e-far from monotone, and for any s > 0, either 
there is a matching of violated edges completely contained in the middle layers of cardinality 
at least e2"'~^/s, or the total number of violated edges is at least £:s2"~^. 

Proof. For any matching M of violated pairs, the average length of M is defined to be the 
quantity 

Choose M to be maximum cardinality matching of violated pairs with the smallest average 
length, denoted by r. Since / is e-far from being monotone, and M is a maximal matching, 
we know that |M| > e2"~^. The proof follows from the following two lemmas that we prove 
in subsequent subsections. □ 

Lemma 3.1. // the average length of M is r, then there exists a matching E of violated 
edges in the middle layers of the hypercube with size at least e2"'~^/r. 

Proof. Deferred to §2.4. □ 
Lemma 3.2. // the average length of M is r, then there are at least r62"-^^ violated edges. 

Proof. Deferred to §2.5. □ 

It is now routine to prove Theorem 1 and Theorem 2. 

Proof of Theorem 1: Set s = rv'l^e^l'^. Suppose the largest matching in the middle 
layers has size at least e2'^~^/s. By Lemma 2.2, we get that the path tester succeeds with 
probability 

e /e\'' 



n 



- 



^/n \sJ 

If the largest matching is at most e2'^~^/s, then by Theorems there are at least £s2"~^ 
violated edges. The edge tester succeeds with probability ^l{s€/n) = ^l{n~^^^€^^^). □ 

Proof of Theorem 2: The number of violated edges is at most the number of influential 
edges, I(/)2"~^. From Theorems, with s = I{f)/6 in Theorems, we know there exists a 
violated edge matching of cardinality f2(£^2"/I(/)) in the middle layers. By Lemma 2.2, the 
path tester succeeds with probability f2(n~^/^£:^/I^(/))). We do not need the edge tester for 
these functions. □ 
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2.4. Proof of Lemma 3.1. 

We first state the routing theorem of Lehman and Ron. 

Theorem 4 (Lehman-Ron [10]). Let S,TZ be two subsets of points of the hypercube such 
that \S\ = \7l\ = m and all points in S (respectively, 71) have j (respectively, i) ones, with 
i < j. Furthermore, suppose there is a mapping (p : S ^ IZ such that S >- 0(5'), WS e S, 
that is, {S,(f){S)) are ancestor-descendants. Then there exists m disjoint saturated chains 
(vertex disjoint paths) that contain all of S and TZ. 

For convenience, we represent the matching through ordered pairs, so if (x, y) G M, then 
X ~< y (and f{x) = 1, f{y) = 0). Recall M is a maximum cardinality matching in the 
violated graph with the smallest average length. Among such matchings, let M actually be 
one maximizing 

<i>(M):= J2 \\^-y\\i 

We prove a structural claim regarding M. Two pairs {x,y) and {x',y') cross if (a) there 

exists a z such that x z y and x' z y' and (b) the intervals \y\] and \y'\] 

strictly cross, meaning that neither interval contains the other. (By (a), the intervals \y\] 
and \y'\] must intersect.) 

Claim 4.1. There are no crossing pairs in M. 

Proof. Suppose {x, y) and {x', y') cross, then consider M' formed by deleting these pairs from 
M and adding (x, y') and {x\ y). Note that these are valid violations due to the presence of 
the vertex z. Furthermore, check that $(M') > $(M) since the sum of squares of a pair of 
numbers having a fixed sum increases as the maximum (of the pair) increases. □ 

For every two levels i < j of the hypercube, let Mjj C M be the pairs with endpoints in 
these level sets. Apply Theorem4 to get a collection of |Mj j| vertex disjoint paths. Each of 
these vertex disjoint paths contain at least one violated edge, and let Fij be the set of these 
edges. Note that Fij forms a matching. Consider the multiset F formed by the union of Fij 
over the set {(«, j) '■ i,j & le, i < j, j — < 2r}, containing duplicates. Note that all edges 
of F lie in the middle layers. 

Claim 4.2. |F| > |M|/4. 

Proof. Note that |F| = jyi jei^ i<j j-i<2r Since the matching M has average length 

r, by Markov's inequality, at least |M|/2 of these pairs have length at most 2r. Furthermore, 
from Prop. 2.1 we get that at most e^2"' < \M\/4 pairs in M have endpoints not in the middle 
layer. Looking at the remainder, we get J2(i,j):i,j&ie,i<j,j-i<2r ^ 1^1/4- □ 

Claim 4.3. No point z G {0, 1}"" has more than 2r edges of F incident on it. 

Proof. Pick a vertex z, and pick any two edges /i and /2 of F incident on it. Since each 
Fij is a matching, these must lie in different Fij^s. Suppose they are Fij and Fq ^, where i 
could be a and j could be b, but not both together. Note that i < \z\ < j and a < \z\ < b. 
We claim that [i,j] and [a,b] cannot cross, and therefore one must strictly lie in the other. 
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There can be at most 2r intervals containing \z\ satisfying such containment relationships. 
Thus, there can be most 2r edges of F incident on z. 

We now prove that [i, j] and [a, h] cannot cross. Consider the pairs in M; and let them be 
(xi, (x2, 1/2); • • • , {xki Uk)- Note that Theorem4 implies k vertex disjoint paths containing 
all these vertices. Hence, there is some permutation vr such that for each i G [/c], there is a 
path from Xi to y7r(i)- Let M[^ - = {(xj, 1/7^(4))}. Similarly, get M'^ f^. Let M' be the matching 
where Mjj and Ma^i, are replaced by M'^ j and M'^ f^ (and all other pairs remain). Note that 
M' has the same average length, same cardinality and $(M') = $(M). 

But now, we have a pair {x,y) G M^j such that x z y, and a pair {x',y') G M'^ f^ such 
that x' -< 2; -< y'. This is because z is incident to an edge in both Fij and Fa^b. If and 
[a, 6] cross, then {x,y) and {x',y') cross. Since M' maximizes the potential $, Claim 4.1 is 
contradicted. □ 

Since the multigraph induced by F has maximum degree 2r, there exists a matching E C F 
of size |£^| > |F|/4r. This can be obtained by picking an edge in E arbitrarily and deleting 
all edges incident to any of its endpoints from F. For every edge added to E there are 
at most 4r edges deleted from F. Therefore, following Claim 4.2, we get a matching E of 
violated edges of size > |M|/16r > e2^^^/r. Furthermore, they all lie in the middle layers. 
This completes the proof of Lemma 3.1. 

2.5. Proof of Lemma 3.2. 

Let Mi be the set of pairs in M that cross dimension i, that is, Mj := {{x,y) E M : Xi = 

0, yi = 1}. Lemma 3.2 follows from following theorem. 

Theorem 5. The number of violated edges across dimension i is at least |Mj|. 

Note that |Mj| = J2{x y)eM Wv ~ — since a pair {x,y) appears in precisely 

\y\ — \x\ different Mj's. Theorems implies that the number of violated edges is at least 
|Mj|, and therefore, since \M\ > £:2"~^. This completes the proof of Lemma 3.2. We now 
prove Theorems. 

Proof. (Proof of Theorem S.) The proof requires setting up some of the machinery of [4]. 
Let H be the perfect matching of the hypercube formed by the edges crossing the dimension 

1. Let X be the endpoints of Mj. For all a; G X, we define a sequence S^^ as follows. The 
first term Sx{0) is x. For even i, S^i^i + 1) = H{Sx{i))- For odd i, if Sx{i) G X, or is 
M-unmatched, then S^; terminates. Otherwise, Sx{i + 1) = M{Sx{i)). Above, we have 
used the shorthand M{v) and H{v) to denote the partners of v in the matchings M and 
H, respectively. (Observe that we use M and not Mj. The matching is only used to 
define X.) The best way to think about Sx is via alternating paths and cycles formed by 
the matchings M and H. We start at x and take the H-edge along the alternating path. We 
keep on moving till we reach an endpoint or another vertex in X. Thus, each S^, terminates. 
It is not hard to see that if ends at ?/ G X, then Sj^ is just S^, in reverse. Furthermore, 
Sx and Sy are disjoint unless y terminates S^^. Therefore the number of sequences is at least 
|X|/2 = \Mi\. The theorem therefore is a consequence of the following lemma which is the 
heart of the proof. □ 
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Lemma 5.1. For allx, contains a violated edge in H. 

Proof. We will prove this through contradiction, and will henceforth assume that (for some 
x), Sx has no violated edge in H. We show that Sx cannot terminate, completing the 
contradiction. For brevity, let us use Sj to denote Sx{i)- Let (x, y) be the pair in Mj. We use 
s_i to denote y. Wlog, assume x >- y, thus Xi = 1 and yi = 0. Also f{y) = 1 and f{x) = 
since the pair is a violation. 

Let Bfc [b = {0, 1}) be the n — 1 dimensional hypercube where ith coordinate is b. We will 
use d{x,x') for the Hamming distance between two points x and x'. We have the following 
simple claim. 

Claim 5.1. Let j > be an index and suppose Sj exists. For j = (mod 4), f{sj) = and 
Sj G Bi; j = 1 (mod4), f{sj) = and Sj G Bg; j = 2 (mod4), f{sj) = 1 and Sj G Bq; 
j = 3 (mod 4), f{sj) = 1 and Sj G Bi. 

Proof. We prove by induction on j. For the base case, Sq = x, and f{x) = 0, Sq G Bi. 
Consider j = (mod4), j > 1. By the induction hypothesis, /(sj_i) = 1 and Sj_i G Bi. 
Since Sj = M(sj_i) and (sj_i,Sj) is a violation, f{sj) = and Sj G Bi. Consider j = 
1 (mod 4). By the induction hypothesis, /(sj_i) = and sj-i G Bi. Since Sj = H{sj-i) and 
{sj-i, Sj) is not a violation, f{sj) = and Sj G Bq. The remaining cases are analogous. □ 

Claim 5.2. Let j > be even. Then (sj,Sj+3) is a violation and d{sj,Sj^3) = d{sj+i, Sj+2) ■ 
Also, the pair {y, si) is a violation and d{y, si) = d{y, sq) — 1. 

Proof. Suppose j = 2 (mod 4). Then from the above claim f{sj) = 1 and /(sj+a) = 0. 
The claim also implies /(sj+i) = 1 and since (sj+i, Sj+2) G M, >- Sj+i. Furthermore, 
both Sj+i, Sj4.2 G Bi. Since Sj = H{sj+i) and = if(sj+2), we get (a) Sj+3 >- sj as well, 
implying {sj, Sj+3) is a violation, and (b) d{sj, Sj+3) = c?(sj+i, Sj+2). The case j = (mod 4) 
is analogous. 

Observe that (y, Sq) is a violation where y E Bq and Sq G Bi. Since Si = H{so), Si has 
the same coordinates as sq except for the ith one. Also, f{si) = 0. Therefore, {y, si) is a 
violation and d{y, si) = d{y, sq) — 1. □ 

We will now argue that S^^ cannot terminate. Consider Sj for j = 1 (mod 4). Because 
f{sj) = and Sj G Bq, Sj cannot participate in a pair in Mj. Hence, Sj ^ X. Suppose Sj was 
unmatched. Consider the following set of pairs in M: A = {(s^, Sk+i)\k odd, — 1 < A; < j— 2}. 
Suppose we replaced these pairs in M by S = {(?/, si)} U {{sk, Sk+3)\k even, < A; < j — 3}. 
Note that \A\ = \j/2] = \B\. Also, by Claim 5.2 



This means that replacing A and inserting B (in M) leads to a violation matching of the same 
size with a smaller Hamming distance. This violates the property of M (that of minimum 
average Hamming distance), and therefore, the sequence cannot terminate. This cannot 
occur, and therefore, every S^; must contain a violated edge in H. This ends the proof of 




fe=i 

k odd 



k odd 



Lemma 5.1. 
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3. Conclusion 

In this paper, we make progress on the question of testing monotonicity of Boolean functions 
over the hypercube. Theorems, Lemma2.2 and Lemma2.1 are tight, and the exponent of 5/6 
is the best we can hope for using our analysis. Our approach in general falls short of the ^Jn 
bound. Nevertheless, we believe the path tester (alone) is a 0(y^)-query monotonicity tester 
for Boolean functions. A possible approach is suggested by Theorem 2. Can we perform a 
different analysis (or even a different algorithm) for high average sensitivity functions? 
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